Applications of the self-organising map to reinforcement learning
نویسنده
چکیده
This article is concerned with the representation and generalisation of continuous action spaces in reinforcement learning (RL) problems. A model is proposed based on the self-organising map (SOM) of Kohonen [Self Organisation and Associative Memory, 1987] which allows either the one-to-one, many-to-one or one-to-many structure of the desired state-action mapping to be captured. Although presented here for tasks involving immediate reward, the approach is easily extended to delayed reward. We conclude that the SOM is a useful tool for providing real-time, on-line generalisation in RL problems in which the latent dimensionalities of the state and action spaces are small. Scalability issues are also discussed.
منابع مشابه
An Empirical Investigation into Function Approximation with Reinforcement Learning
In the reinforcement learning framework, standard, table-based look-up methods for value functions converge to the optimal solution, yet unfortunately these methods are intractable for complex real-world control problems. A common approach to overcome this problem are so-called function approximation techniques that generalise over their input spaces. In this paper we study the capabilities of ...
متن کاملCritic-based Learning of Actions with Self-Organising Feature Maps
In this paper we develop a mechanism for critic-based learning in continuous state and action spaces. Our approach is based on the Motoric Map model [RMS90], by which we wish to overcome the restrictions of traditional Reinforcement Learning methods concerning continuous spaces. Covariance Learning is introduced as algorithm to determine the best possible action for a given state using the crit...
متن کاملIntroduction: new developments in self-organising maps
Self-organisation is a universal phenomenon observable in many natural systems: both animate and inanimate. It is often easier to recognise self-organisation than to define it. The origins of Kohonen’s self-organising map (SOM) was as a simplified model of the relatively homogeneous structures found in mammalian brains, associated with the processing of sensory data, that exhibit self-organisat...
متن کاملActive audition using the parameter-less self-organising map
This paper presents a novel method for enabling a robot to determine the position of a sound source in three dimensions using just two microphones and interaction with its environment. The method uses the Parameter-Less SelfOrganising Map (PLSOM) algorithm and Reinforcement Learning (RL) to achieve rapid, accurate response. We also introduce a method for directional filtering using the PLSOM. T...
متن کاملDecentralised Multi-Agent Reinforcement Learning for Dynamic and Uncertain Environments
Multi-Agent Reinforcement Learning (MARL) is a widely used technique for optimization in decentralised control problems. However, most applications of MARL are in static environments, and are not suitable when agent behaviour and environment conditions are dynamic and uncertain. Addressing uncertainty in such environments remains a challenging problem for MARL-based systems. The dynamic nature ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Neural networks : the official journal of the International Neural Network Society
دوره 15 8-9 شماره
صفحات -
تاریخ انتشار 2002